STIC Biotechnology Systems Branch 



RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Number: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION AND PATENTIN SOFTWARE QUESTIONS, PLEASE CONTACT 
MARK SPENCER, TELEPHONE: 571-272-2510; FAX: 571-273-0221 



TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 4.4.0 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker/chkrnote.htm 



Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 

Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: Commissioner for Patents, P.O. Box 1450, Alexandria, VA 223 13-1450 

3. Hand Carry, Federal Express, United Parcel Service, or other delivery service (EFFECTIVE 01/14/05): 
U.S. Patent and Trademark Office, Mail Stop Sequence, Customer Window, Randolph Building, 401 Dulany Street, 
Alexandria, VA 22314 

Revised 01/10/06 



Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION SERUL NUMBER: 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 

1 Wrapped Nucleics The number/text at the end of each line "wrapped" dowTi to the next line. This may occur if your file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to 3; this will 
prevent "wrapping." 



Invalid Line Length The rules require that a line not exceed 72 characters in length. This includes white spaces. 



^Misaligned Amino The numbering under each 5^ amino acid is misaligned. Do not use tab codes between numbers; 
Numbering use space characters, instead. 



Non-ASCII 



The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submissipn is saved iniASCII text. 
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Variable Length 



Patentin 2.0 
"bug" 



™ce(s)^ ^contain n's or Xaa's representinj 



Sequence(s ) c ontain n's or Xaa's representing more than one residue. Per Sequence Rules, 
each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug" in Patentin version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences(s) . Normally, Patentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 



Skipped Sequences 
"(OLD RULES) 



Sequence(s) _ 



missing. If intentional, please insert the following lines for each skipped sequence: 



(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include the skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence. 



_Skipped Sequences Sequence(s) 

(NEW RULES) <2 1 0> sequence id number 

<400> sequence id number 

000 



9 Use of n's or Xaa's 

(NEW RULES) 

10 Invalid <213> 

Response 

11 Useof<220> 



Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. • 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
is Artificial Sequence 



Sequence(s) _ 



_ missing the <220> "Feature" and associated numeric identifiers and responses. 



_PatentIn 2.0 
""bug" 



Use of <220> to <223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 

"Unknown." Please explain source of genetic material in <220> to <223> section. 

(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" ftinction of Patentin version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 
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Misuse of n/Xaa "n" can only represent a single nucleotide: "Xaa" can only represent a single amino acid 



AMC - Biotechnology Systems Branch - 09/09/2003 



RAW SEQUENCE LISTING DATE: 02/06/2006 

PATENT APPLICATION: US/10/538,041 TIME: 14:24:06 



Input Set : A:\Sequence Listing for USSN 10-538041.txt 
Output Set: N:\CRP4\02012006\J538041.raw 

4 <110> APPLICANT: TOOLGEN, Inc. 

6 <120> TITLE OF INVENTION: Regulatory Zinc Finger Proteins 
8 <130> FILE REFERENCE: Q88285 

10 <140> CURRENT APPLICATION NUMBER: US 10/538,041 . yj 

11 <141> CURRENT FILING DATE: 2005-06-08 {rT O 

13 <150> PRIOR APPLICATION NUMBER: US 60/431,892 "^ffu^ 

14 <151> PRIOR FILING DATE: 2002-12-09 U " 
16 <160> NUMBER OF SEQ ID NOS: 129 '^OCS Af 

18 <170> SOFTWARE: Patentin version 3.2 Corf ^ ^OltlQl^ 

20 <210> SEQ ID NO: 1 Ofefr #* 

21 <211> LENGTH: 23 ® '^©edlV* 

22 <212> TYPE: PRT ^ 

23 <213> ORGANISM: Homo sapiens 
25 <400> SEQUENCE: 1 

27 Tyr Lys Cys Lys Gin Cys Gly Lys Ala Phe Gly Cys Pro Ser Asn Leu 

28 1 5 10 15 

30 Arg Arg His Gly Arg Thr His 

31 20 

34 <210> SEQ ID NO: 2 

35 <211> LENGTH: 23 

36 <212> TYPE: PRT 

37 <213> ORGANISM: Homo sapiens 
39 <400> SEQUENCE: 2 

41 Tyr Ser Cys Gly lie Cys Gly Lys Ser Phe Ser Asp Ser Ser Ala Lys 

42 1 5 10 15 

44 Arg Arg His Cys lie Leu His 

45 20 

48 <210> SEQ ID NO: 3 

49 <211> LENGTH: 23 

50 <212> TYPE: PRT 

51 <213> ORGANISM: Homo sapiens 
53 <400> SEQUENCE: 3 

55 Tyr Thr Cys Ser Asp Cys Gly Lys Ala Phe Arg Asp Lys Ser Cys Leu 

56 1 5 10 15 

58 Asn Arg His Arg Arg Thr His 

59 20 

62 <210> SEQ ID NO: 4 

63 <211> LENGTH: 23 

64 <212> TYPE: PRT 

65 <213> ORGANISM: Homo sapiens 
67 <400> SEQUENCE: 4 

69 Tyr Lys Cys Gly Gin Cys Gly Lys Phe Tyr Ser Gin Val Ser His Leu 

70 1 5 10 15 



file://C:\CRF4\Outhold\VsrJ538041.htm 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/538,041 



DATE: 02/06/2006 
TIME: 14:24:06 



Input Set : A:\Sequence Listing for USSN 10-538041.txt 
Output Set: N:\CRF4\02012006\J538041.raw 



72 Thr Arg His Gin Lys He His 

73 20 



76 <210> SEQ ID NO: 5 

77 <211> LENGTH: 23 

78 <212> TYPE: PRT 

79 <213> ORGANISM: Homo sapiens 
81 <400> SEQUENCE: 5 

83 Tyr Lys Cys Glu Glu Cys Gly Lys Ala Phe Arg Gin Ser Ser His Leu 

84 1 5 10 15 

86 Thr Thr His Lys He He His 

87 20 

90 <210> SEQ ID NO: 6 

91 <211> LENGTH: 23 

92 <212> TYPE: PRT 

93 <213> ORGANISM: Homo sapiens 
95 <400> SEQUENCE: 6 

97 Tyr Glu Cys Glu Lys Cys Gly Lys Ala Phe Asn Gin Ser Ser Asn Leu 

98 1 5 10 15 

100 Thr Arg His Lys Lys Ser His 

101 20 

104 <210> SEQ ID NO: 7 

105 <211> LENGTH: 23 

106 <212> TYPE: PRT 

107 <213> ORGANISM: Homo sapiens 
109 <400> SEQUENCE: 7 

111 Tyr Val Cys Ser Lys Cys Gly Lys Ala Phe Thr Gin Ser Ser Asn Leu 

112 15 10 15 

114 Thr Val His Gin Lys He His 

115 20 

118 <210> SEQ ID NO: 8 

119 <211> LENGTH: 23 

120 <212> TYPE: PRT . 

121 <213> ORGANISM: Homo sapiens 
123 <400> SEQUENCE: 8 

125 Tyr Lys Cys Pro Asp Cys Gly Lys Ser Phe Ser Gin Ser Ser Ser Leu 

126 15 10 15 

128 He Arg His Gin Arg Thr His 

129 20 

132 <210> SEQ ID NO: 9 

133 <211> LENGTH: 25 

134 <212> TYPE: PRT 

135 <213> ORGANISM: Homo sapiens 
137 <400> SEQUENCE: 9 

139 Tyr Val Cys Asp Val Glu Gly Cys Thr Trp Lys Phe Ala Arg Ser Asp 

140 15 10 15 

142 Glu Leu Asn Arg His Lys Lys Arg His 

143 20 25 

146 <210> SEQ ID NO: 10 

147 <211> LENGTH: 23 



file://C:\CRF4\Outhold\VsrJ538041 .htm 
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RAW SEQUENCE LISTING DATE: 02/06/2006 

PATENT APPLICATION: US/10/538,041 TIME: 14:24:06 

Input Set : A:\Sequence Listing £or USSN 10-538041.txt 
Output Set: N:\CRF4\02012006\J538041.raw 

148 <212> TYPE: PRT 

149 <213> ORGANISM: Homo sapiens 
151 <400> SEQUENCE: 10 

154 Phe Gin Cys Lys Thr Cys Gin Arg Lys Phe Ser Arg Ser Asp His Leu 

155 15 10 15 

157 Lys Thr His Thr Arg Thr His 

158 20 

161 <210> SEQ ID NO: 11 

162 <211> LENGTH: 23 

163 <212> TYPE: PRT 

164 <2i3> ORGANISM: Homo sapiens 
166 <400> SEQUENCE: 11 

168 Tyr Lys Cys Met Glu Cys Gly Lys Ala Phe Asn Arg Arg Ser His Leu 

169 1 5 10 , 15 

171 Thr Arg His Gin Arg He His 

172 20 

175 <210> SEQ ID NO: 12 

176 <211> LENGTH: 23 

177 <212> TYPE: PRT 

178 <213> ORGANISM: Homo sapiens 
180 <400> SEQUENCE: 12 

182 Tyr He Cys Arg Lys Cys Gly Arg Gly Phe Ser Arg Lys Ser Asn Leu 

183 15 10 15 

185 He Arg His Gin Arg Thr His 

186 20 

189 <210> SEQ ID NO: 13 

190 <211> LENGTH: 23 

191 <212> TYPE: PRT 

192 <213> ORGANISM: Homo sapiens 
194 <400> SEQUENCE: 13 

196 Tyr Glu Cys Asp His Cys Gly Lys Ala Phe Ser Val Ser Ser Asn Leu 

197 15 10 15 

199 Asn Val His Arg Arg He His 

200 20 

204 <210> SEQ ID NO: 14 

205 <211> LENGTH: 23 

206 <212> TYPE: PRT 

207 <213> ORGANISM: Homo sapiens 
209 <400> SEQUENCE: 14 

211 Tyr Thr Cys Lys Gin Cys Gly Lys Ala Phe Ser Val Ser Ser Ser Leu 

212 15 10 15 

214 Arg Arg His Glu Thr Thr His 

215 20 

218 <210> SEQ ID NO: 15 

219 <211> LENGTH: 23 

220 <212> TYPE: PRT 

221 <213> ORGANISM: Homo sapiens 
223 <400> SEQUENCE: 15 

225 Tyr Glu Cys Asn Tyr Cys Gly Lys Thr Phe Ser Val Ser Ser Thr Leu 
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RAW SEQX7ENCE LISTING 

PATENT APPLICATION: US/10/538,041 



DATE: 02/06/2006 
TIME: 14:24:06 



Input Set : A:\Sequence Listing for USSN 10-538041.txt 
Output Set: N:\CRF4\02012006\J538041.raw 



226 1 5 

228 He Arg His Gin Arg He His 

229 20 



10 



15 



232 <210> SEQ ID NO: 16 

233 <211> LENGTH: 23 

234 <212> TYPE: PRT 

235 <213> ORGANISM: Homo sapiens 
237 <400> SEQUENCE: 16 

239 Tyr Arg Cys Glu Glu Cys Gly Lys Ala Phe Arg Trp Pro Ser Asn Leu 

240 15 10 15 

242 Thr Arg His Lys Arg He His 

243 20 

246 <210> SEQ ID NO: 17 

247 <211> LENGTH: 23 

248 <212> TYPE: PRT 

249 <213> ORGANISM: Homo sapiens 
251 <400> SEQUENCE: 17 

254 Tyr Glu Cys Asp His Cys Gly Lys Ser Phe Ser Gin Ser Ser His Leu 

255 15 10 15 

257 Asn Val His Lys Arg Thr His 

258 20 

261 <210> SEQ ID NO: 18 

262 <211> LENGTH: 23 

263 <212> TYPE: PRT 

264 <213> ORGANISM: Homo sapiens 
266 <400> SEQUENCE: 18 

268 Phe Leu Cys Gin Tyr Cys Ala Gin Arg Phe Gly Arg Lys Asp His Leu 

269 15 10 15 

271 Thr Arg His Met Lys Lys Ser 

272 20 

275 <210> SEQ ID NO: 19 

276 <211> LENGTH: 24 

277 <212> TYPE: PRT 

278 <213> ORGANISM: Artificial 

280 <220> FEATURE: 

281 <223> OTHER INFORMATION: Artificial zinc finger domain 
283 <400> SEQUENCE: 19 

285 Tyr Arg Cys Lys Tyr Cys Asp Arg Ser Phe Ser Asp Ser Ser Asn Leu 

286 15 10 15 

288 Gin Arg His Val Arg Asn He His 

289 20 

292 <210> SEQ ID NO: 20 

293 <211> LENGTH: 83 

294 <212> TYPE: PRT 

295 <213> ORGANISM: Artificial 

297 <220> FEATURE: 

298 <223> OTHER INFORMATION: artificial zinc finger protein 
300 <400> SEQUENCE: 20 

302 Tyr Lys Cys Gly Gin Cys Gly Lys Phe Tyr Ser Gin Val Ser His Leu 
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Pages 



RAW SEQUENCE LISTING DATE: 02/06/2006 

PATENT APPLICATION: US/10/538,041 TIME: 14:24:06 

Input Set : A:\Sequence Listing for USSN 10-538041.txt 
Output Set: N:\CRF4\02012006\J538041.raw 



303 


1 


5 










10 








15 




304 


Thr Arg 


His Gin Lys 


He 


His 


Thr Gly 


Glu 


Lys 


Pro 


Phe 


Gin Cys 


Lys 


305 




20 








25 










30 




307 


Thr Cys 


Gin Arg Lys 


Phe 


Ser Arg 


Ser 


Asp His 


Leu 


Lys 


Thr His 


Thr 


308 




35 






40 










45 






310 


Arg Thr 


His Thr Gly 


Glu 


Lys 


Pro 


Tyr 


ile 


Cys Arg 


Lys 


Cys Gly 


Arg 


311 


50 






55 










60 








313 


Gly Phe 


Ser Arg Lys 


Ser 


Asn 


Leu 


He 


Arg 


His 


Gin Arg Thr His 


Thr 


314 


65 




70 










75 








80 


316 


Gly Glu 


Lys 























320 <210> SEQ ID NO: 21 

321 <211> LENGTH: 83 

322 <212> TYPE: PRT 

323 <213> ORGANISM: Artificial 

325 <220> FEATURE: 

326 <223> OTHER INFORMATION: artificial zinc finger protein 
328 <400> SEQUENCE: 21 

330 Tyr Lys Cys Glu Glu Cys Gly Lys Ala Phe Arg Gin Ser Ser His Leu 

331 1.5 10 15 

333 Thr Thr His Lys lie Ile His Thr Gly Glu Lys Pro Tyr Lys Cys Met 

334 20 25 30 

336 Glu Cys Gly Lys Ala Phe Asn Arg Arg Ser His Leu Thr Arg His Gin 

337 35 40 45 

339 Arg lie His Thr Gly Glu Lys Pro Phe Gin Cys Lys Thr Cys Gin Arg 

340 50 55 60 

342 Lys Phe Ser Arg Ser Asp His Leu Lys Thr His Thr Arg Thr His Thr 

343 65 70 75 80 
345 Gly Glu Lys 

349 <210> SEQ ID NO: 22 

350 <211> LENGTH: 83 

351 <212> TYPE: PRT 

352 <213> ORGANISM: Artificial 

354 <220> FEATURE: 

355 <223> OTHER INFORMATION: artificial zinc finger protein 
357 <400> SEQUENCE: 22 

359 Tyr Lys Cys Gly Gin Cys Gly Lys Phe Tyr Ser Gin Val Ser His Leu 

360 15 10 15 

362 Thr Arg His Gin Lys Ile His Thr Gly Glu Lys Pro Phe Gin Cys Lys 

363 20 25 30 

365 Thr Cys Gin Arg Lys Phe Ser Arg Ser Asp His Leu Lys Thr His Thr 

366 35 40 45 

368 Arg Thr His Thr Gly Glu Lys Pro Tyr Lys Cys Met Glu Cys Gly Lys 

369 50 55 60 

371 Ala Phe Asn Arg Arg Ser His Leu Thr Arg His Gin Arg lie His Thr 

372 65 70 75 80 
374 Gly Glu Lys 

378 <210> SEQ ID NO: 23 

379 <211> LENGTH: 83 

380 <212> TYPE: PRT 
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<210> 123 

<211> 21 

<212> PRT 

<213> Artificial 

<220> 

<223> zinc finger consensus 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(2) . . (2) 
any amino acid 



<220> 

< 2 2 1 > MI SC_FEATURE 
<222> (3).. 

<223> /filetween 1 (and 4 "^mino acids of any amino acic 



<220> 

<221> MISC_FEATURE 

<222> (20).. (20) y^^/^ \^ 

<223> between one >and three] residues of any amino acid 

<400> 123 ^ 




RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/10/538,041 



DATE: 02/06/2006 
TIME: 14:24:07 



Input Set : A:\Sequence Listing for USSN 10-538041.txt 
Output Set: N:\CRF4\02012006\J538041.raw 



Please Note; 

Use o£ n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:123; Xaa Pes. 2,3,5,6,7,8,9,10,11,12,13,14,15,16,18,19,20 
Seq#:124; Xaa Pes. 2,3,5,6,7,8,9,11,14,15,18,19,20 
Seq#:125; Xaa Pes. 3,4,6 



Invalid <213> Response^t^ ^ 
Use of "Artificial" onl^as " <2 13 > Organism" response is incomplete/ 
per l«823(b) of New Sequence RulM^ Valid response is Artificial Sequence. 




Seq# : 19, 20, 21, 22, 23, 24, 25 J3«y^27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42 
Seq# : 43, 44, 45, 46, 47, 48^^^^^50^^, 52, 53,54, 55, 56,57,58,59,60,61,62,63,6 
Seq#: 67, 68, 69, 70, 71, 1^,124,125X126,127,128 
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VERIFICATION SUMMIOIY DATE: 02/06/2006 

PATENT APPLICATION: US/10/538,041 TIME: 14:24:07 



Input Set : A:\Sequence Listing for USSN 10-538041.txt 
Output Set: N:\CRF4\02012006\J538041.raw 

L:3279 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:123 after pos.:0 
M:341 Repeated in SeqNo=123 

L:3304 M:283 W: Missing Blank Line separator, <220> field identifier 
L:3351 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:124 after pos . : 0 
M:341 Repeated in SeqNo=124 

L:3384 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:125 after pos . : 0 
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